1354-1364 Nucleic Acids Research, 2014, Vol. 42, No. 2 
doi:10.1093/nar/gkt964 



Published online 22 October 2013 



Staphylococcus aureus protein SAUGI acts as a 
uracil-DNA glycosylase inhibitor 

Hao-Ching Wang 1,2 , Kai-Cheng Hsu 3 , Jinn-Moon Yang 3,4,5 , Mao-Lun Wu\ 
Tzu-Ping Ko 1 ' 6 , Shen-Rong Lin 3 and Andrew H.-J. Wang 1,2,6 * 

institute of Biological Chemistry, Academia Sinica, Taipei 115, Taiwan, 2 Graduate Institute of Translational 
Medicine, College of Medical Science and Technology, Taipei Medical University, Taipei 110, Taiwan, 
institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu, 30050, Taiwan, 
department of Biological Science and Technology, National Chiao Tung University, Hsinchu, 30050, Taiwan, 
5 Center for Bioinformatics Research, National Chiao Tung University, Hsinchu, 30050, Taiwan and 
6 Core Facilities for Protein Structural Analysis, Academia Sinica, Taipei 115, Taiwan 

Received June 28, 2013; Revised and Accepted September 30, 2013 



ABSTRACT 

DNA mimic proteins are unique factors that control 
the DNA binding activity of target proteins by 
directly occupying their DNA binding sites. The 
extremely divergent amino acid sequences of the 
DNA mimics make these proteins hard to predict, 
and although they are likely to be ubiquitous, to 
date, only a few have been reported and functionally 
analyzed. Here we used a bioinformatic approach to 
look for potential DNA mimic proteins among previ- 
ously reported protein structures. From ~14 candi- 
dates, we selected the Staphylococcus conserved 
hypothetical protein SSP0047, and used proteomic 
and structural approaches to show that it is a novel 
DNA mimic protein. In Staphylococcus aureus, 
we found that this protein acts as a uracil-DNA 
glycosylase inhibitor, and therefore named it 
S. aureus uracil-DNA glycosylase inhibitor (SAUGI). 
We also determined and analyzed the complex 
structure of SAUGI and S. aureus uracil-DNA 
glycosylase (SAUDG). Subsequent BIAcore studies 
further showed that SAUGI has a high binding 
affinity to both S. aureus and human UDG. The two 
uracil-DNA glycosylase inhibitors (UGI and p56) 
previously known to science were both found in 
Bacillus phages, and this is the first report of a 
bacterial DNA mimic that may regulate SAUDG's 
functional roles in DNA repair and host defense. 

INTRODUCTION 

Research in the past decade has revealed several examples 
of regulatory proteins that mimic DNA. These proteins 



use negatively charged amino acids to imitate the charge 
distribution of DNA and thus prevent DNA from binding 
to its original target protein by direct competition (1,2). 
DNA mimic proteins can be found in virus, bacteria and 
eukaryotic cells, and they are involved in many important 
control mechanisms, including DNA repair, restriction, 
transcriptional control and DNA packaging (2). All of 
these observations suggest that DNA mimic proteins are 
essential to living cells, and the discovery of new mimics is 
potentially important in many areas of research. However, 
only a few DNA mimic proteins (<20) have so far been 
reported. The reason is that these proteins are hard to 
identify because their amino acid sequences and protein 
structures are extremely divergent (2). 

We have been developing bioinformatic approaches 
to identify new DNA mimics, and here we found several 
candidates. One of these, the Staphylococcus conserved 
protein SSP0047, was selected for further study. In this 
report, we show that SSP0047 (or SAUGI; for 
Staphylococcus aureus uracil-DNA glycosylase inhibitor) 
acts as a uracil-DNA glycosylase inhibitor that breaks the 
uracil-removing activity of S. aureus uracil-DNA 
glycosylase (SAUDG). We also determined the structure 
of the SAUGI/SAUDG complex, and used surface 
plasmon resonance (BIAcore) to show that SAUGI has 
a high binding affinity to UDG. 

Functionally, UDGs remove the uracils in DNA that 
result from the spontaneous deamination of cytosine or 
the incorporation of dUTP during replication (3,4). To 
date, only two uracil-DNA glycosylase inhibitors (UGI 
and p56) have been identified. One of these, phage PBS2 
UGI, forms a tight and physiologically irreversible 
complex with a variety of UDG proteins in 1:1 molar 
stoichiometry (4-8). The other protein, p56, was identified 
in the Bacillus phage § 29. Although its dimeric structure 
is different from the monomeric UGI, p56 has been shown 
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to inhibit UDG's activity as well (9-12). SAUGI is there- 
fore the third uracil-DNA glycosylase inhibitor that 
has been identified, and the first in a species other than 
bacterial phage. 

MATERIALS AND METHODS 

Bioinformatic search for possible DNA mimic candidates 
in the protein structure database 

To function as a DNA mimic, a protein must have two 
critical properties: a DNA-like arrangement of negative 
charges on its surface and an appropriate structural con- 
formation (2,13). Here, we used these two properties to 
search for potential DNA mimic proteins in the Protein 
Data Bank (PDB). First, we used the 12 known DNA 
mimic proteins listed in Supplementary Table SI (5- 
8,14-24) as starting queries to search on the DALI 
server (25) for proteins with loosely similar structures 
(Z-scqre >4.0 and root-mean-square deviation (RMSD) 
<3.5A; the RMSD is a measure of average deviation in 
distance between the aligned a-carbons in structural 
superimposition, while the Z-Score is a measure of align- 
ment quality, with values above eight indicating good 
structural superimposition). Next, the list of candidate 
proteins was further reduced by applying additional con- 
straints that were deduced from all 12 of the published 
DNA mimic proteins (Supplementary Table SI): (i) a 
protein size of <200 amino acids; (ii) a total of at least 
10 aspartic acid and/or glutamic acid residues on the 
protein surface; and (iii) a negative charge on at least 
10% of the surface residues. Finally, 14 proteins were 
considered potential DNA mimic proteins based on 
the similarity of negative charge distributions to the 
original query proteins (Supplementary Figure SI and 
Supplementary Table S2). 

Preparation and purification of recombinant SAUGI 
and SAUDG 

For N-terminal His 10 -tagged SAUGI, the full-length 
SAUGI gene (NCBI sequence ID: AAL26663.1, amino- 
acid residues 1-112) with the stop codon was ligated into 
pET16b expression vector (Novagen). For C-terminal 
His 6 -tagged SAUDG, the full-length SAUDG gene 
(NCBI sequence ID: YP 040034.1, amino-acid residues 
1-218) and human UDG gene (NCBI sequence ID and 
PDB: 1SSP_E, amino-acid residues 1-223) without the 
stop codon were ligated into pET21b expression vector 
(Novagen). All vectors were transformed into 
Escherichia coll BL21 (DE3), and after the addition 
of 1 mM isopropyl-P-D-thiogalactopyranoside (IPTG), 
the recombinant proteins were expressed for 16 h at 
16°C. Soluble proteins were purified by immobilized 
metal-ion chromatography with a Ni-NTA column, 
followed by gel filtration using Superdex 75 (GE 
Healthcare). 

For tag-free SAUGI, a pET21b expression vector 
containing the full-length SAUGI gene with the stop 
codon was transformed into E. coli BL21 (DE3). After 
addition of 1 mM IPTG, the recombinant SAUGI 
protein without any tag was expressed for 16 h at 16°C. 



Soluble SAUGI was purified by Q anion exchange chro- 
matography (GE Healthcare) with a gradient of 0-0.75 M 
NaCl in 20 mM Tris, pH 7.4, buffer. SAUGFs purity was 
further improved using gel filtration on a Superdex 75 
column. 

His-pulldown assays to identify proteins that interact 
with SAUGI 

For the first His-pulldown assay, the soluble proteins ex- 
tracted from S. aureus (200 ug) were mixed with purified 
N-terminal Hisi 0 -tagged SAUGI (lmg). The reaction 
mixture was adjusted to a final volume of 1 ml in a 
binding buffer of lx phosphate buffered saline with 
20 mM imidazole, and incubated for 16 h at 4°C. 
Incubation then continued with Ni-NTA beads (50 ul) 
for another 1 h. After incubation, the beads were washed 
with 5 ml of the same binding buffer. Proteins bound to 
the beads were then eluted by the addition of buffer con- 
taining 500 mM imidazole and resolved by sodium 
dodecyl sulphate-polyacrylamide gel electrophoresis. 
Protein bands of interest were excised from the gels and 
digested in-gel by trypsin. LC-nanoESI-MS/MS and the 
search program MASCOT were used to identify the 
proteins. In MASCOT, the selected variable modifications 
were methionine oxidation and cysteine carboxyamido- 
methylation. Significant hits (as defined by Mascot prob- 
ability analysis) were regarded as positive identification. In 
another His-pulldown assay to confirm the interaction 
between SAUGI and SAUDG, 3 uM purified C-terminal 
His 6 -tagged SAUDG was mixed with 1 uM untagged 
SAUGI. Subsequent steps in this assay were the same as 
described above. 

Gel nitration to determine the native molecular weights 
of target proteins 

A Superdex 75 10/300 gel filtration column (GE 
Healthcare) was used to estimate the approximate molecu- 
lar weights (MWs) of SAUGI and SAUDG, as well as the 
SAUGI/SAUDG complex. The buffer used in this assay 
was 30 mM Tris-HCl, pH 7.4, 100 mM NaCl, 5% 
glycerol, 1 mM EDTA and 1 mM Dithiothreitol (DTT). 
Gel filtration standard proteins [ovalbumin (43kDa), 
carbonic anhydrase (29kDa), RNase A (13.7 kDa) and 
aprotinin (6.5 kDa)] were used to create a calibration 
curve for protein MWs on the same gel filtration 
column. For each protein, the logarithm of MW was 
plotted against Kav, which was calculated as follows: 
Kav = (Ve-Vo)/(Vt-Vo), where Ve is the elution 
volume, Vo is the column void volume using blue 
dextran 2000 and Vt is the total column bed volume 
(24 ml for the Superdex 75 10/300 gel filtration column). 

Crystallization, data collection and structure 
determination of SAUDG and the SAUDG/SAUGI 
complex 

For SAUDG crystallization, 2ul of purified C-terminal 
His 6 -tagged SAUDG solution (16mg/ml; 20 mM Tris, 
pH 8.0, and lOOmM NaCl) were mixed with 2ul of 
a reservoir containing 0.1 M Hepes sodium, pH 7.5, 
10% 2-propanol and 18% PEG4000 as a precipitant. 
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Table 1. Data collection and refinement statistics of SAUDG and 
SAUGI/SAUDG crystals 
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The PyMOL program (32) was used for the structural 
analyses and also for figure production. 

Determination of the binding affinities between UGIs 
and UDGs 

The binding affinities of SAUGI and UGI to SAUDG and 
human UDG were determined by surface plasmon reson- 
ance using a BIAcore T200 (GE Healthcare) according to 
the protocols provided by the manufacturer. Briefly, the 
anti-His antibody was immobilized on the CM 5 chip 
surface using the standard amine coupling method. The 
C-terminal His 6 -tagged UDG was then immobilized 
on the CM5 chip surface by binding to the anti-His 
antibody. Typically 20-30 response units were 
immobilized on the individual flow cells of the sensor 
chip. Next, SAUGI or UGI in 10 mM Hepes, 150mM 
NaCl and 0.05% v/v Surfactant P20 was injected for 
1 min at a flow rate of 30 til/min. Association (ka) and 
dissociation (kd) kinetic constants were calculated by 
BIAevaluation 3.1 software using a simple 1:1 Langmuir 
model. The equilibrium dissociation constant KD values 
were calculated by the equation KD = kd/ka. 



For crystallization of the SAUDG/SAUGI complex, 1 mg 
of SAUGI and 2.5 mg of C-terminal His 6 -tagged SAUDG 
were mixed in 100 (.tl of complex buffer (30 mM Tris-HCl, 
pH 7.4, 5% glycerol, 100 mM NaCl, ImM EDTA and 
1 mM DTT) at room temperature (RT) for 1 h. A 
Superdex 75 gel filtration column was then used to 
separate the complex from the uncomplexed monomers. 
Subsequently, 2ul of the complex solution (20mg/ml) 
were mixed with 2ul of a reservoir containing 0.1 M 
sodium Hepes, pH 7.2, and 15 % PEG20000 as a precipi- 
tant. Equilibration with the reservoir was achieved by the 
sitting drop method. Before flash cooling, ethylene glycol 
at a final concentration of 15% was added as a cryopro- 
tectant for both the SAUDG and SAUDG/SAUGI 
complex crystals. 

Native X-ray diffraction data from the SAUDG and 
SAUDG/SAUGI complex crystals were collected on 
beamline 13B at the National Synchrotron Radiation 
Research Center in Hsinchu, Taiwan. The data were 
processed using HKL2000 (26). Molecular replacement 
calculations were performed by the program Molrep 

(27) . For SAUDG, the crystal structure of Vibrio 
cholerae uracil-DNA glycosylase (PDB number: 2JHQ) 
was used as a search model, and for the SAUDG/ 
SAUGI complex, a search model was constructed by 
replacing the UGI in the crystal structure of E. coli 
UDG/UGI (PDB number: 2UGI) with the 
Staphylococcus saprophytics hypothetical protein 
SSP0047 (PDB number: 2KCD). Both initial maps were 
produced after preliminary refinement using Refmac (28), 
and the programs Buccaneer (29), Coot (30) and Refmac 

(28) were used in the subsequent model building and 
refinement. Model bias was removed by simulated anneal- 
ing and checking against composite maps calculated 
by CNS (31) using 20 different models. Statistics for 
the data collection and refinement are shown in Table 1. 



Determination of SAUGI's ability to inhibit SAUDG 
activity 

The methods used here to measure the uracil removing 
activity of SAUDG were modified from procedures 
described in previous reports (9,33). For the single- 
stranded DNA (ssDNA) activity of SAUDG, a 40-mer 
Hex-labeled oligodeoxynucleotide containing a single 
uracil (ssDNA19U; 5'-Hex-GTAAAACGACGGCCAG 
TGUATTCGAGCTCGGTACCCGGGG-3') was used 
as the DNA substrate. For SAUDG's double-stranded 
DNA (dsDNA) activity, 40-mer dsDNA fragments were 
prepared by mixing equal volumes and concentrations of 
ssDNA19U and its no-uracil complementary oligonucleo- 
tide (5'-FAM-CCCCGGGTACCGAGCTCGAAGGCA 
CTGGCCGTCGTTTTAC-3'), incubating the reaction 
mixtures at 95°C for 5 min, and then lowering the tem- 
perature to RT for another 30 min. For each reaction, 
different amounts of purified recombinant SAUGI were 
mixed with C-terminal His 6 -tagged SAUDG. The reaction 
mixtures were adjusted to a volume of 9ul in reaction 
buffer (for the ssDNA reaction: 20 mM Tris-HCl, pH 
8.0, 1 mM DTT, 1 mM EDTA; for the dsDNA reaction: 
50 mM potassium acetate, 20 mM Tris-acetate, pH 7.9, 
10 mM magnesium acetate and ImM DTT) and 
preincubated at 37°C for 15 min. Subsequently, 1 of 
reaction buffer containing the respective ssDNA and 
dsDNA substrates was added to a final concentration of 
0.5 uM, and the incubation continued for 15 min. Lastly, 
to cleave the apyrimidinic sites in the ssDNA, NaOH was 
added to a final concentration of 0.2 M and allowed 
to react at 90°C for 30 min, while apurinic-apyrimidinic 
(AP) endonuclease I (4 units, New England Biolabs) was 
added and allowed to react at 37°C for 30 min to cleave 
the apyrimidinic sites in the dsDNA. The dsDNA reaction 
was terminated by raising the temperature to 95°C for 
5 min. The ssDNA and dsDNA reaction products were 
then mixed with 2x sample loading buffer (95% 
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formamide, 20 mM EDTA, 1 mg/ml bromophenol blue) 
and run on a 20% polyacrylamide gel (0.5 x Tris-borate- 
EDTA buffer, 8 M urea). The results were visualized using 
ultraviolet light. 

RESULTS 

Bioinformatic search for DNA mimic candidates from 
the PDB 

In total, 14 potential DNA mimic proteins were identified 
based on the properties of the known DNA mimic 
proteins. The structures and negative charge distributions 
of these proteins are all similar to their respective query 
proteins (Supplementary Figure SI and Supplementary 
Table S2). Among these candidates, the Staphylococcus 
hypothetical protein SSP0047 (PDB number: 2KCD) is 
similar to the uracil-DNA glycosylase inhibitor from 
phage PBS2 (UGI; PDB number: 1UGI) with a Z-score 
of 5.8 and RMSD of 2.9 A. Although there are several 
other candidates with higher Z-scores and/or lower 
RMSD scores, we selected SSP0047 for further study 



Pull- 
down Input 

SAUDGC-His - + + - + 

SAUGI + - + + - 




2X— 0_ 0 <-SAlDG C-His 




Figure 1. The SAUGI-SAUDG interaction confirmed by His- 
pulldown. In this assay, SAUGI without any tag was used as prey 
and C-terminal His 6 -tagged SAUDG was used as bait. The result 
showed that SAUGI was pulled down by C-terminal His 6 -tagged 
SAUDG. 



because the negative charge distribution on its surface 
bears the highest similarity to its original query protein 
(Supplementary Figure SI). Interestingly, although the 
protein fold and charge distributions of SSP0047 and 
UGI are similar, their amino acid sequences show only a 
weak homology (Supplementary Figure S2). The function 
of this protein had not been identified before, but its 
nuclear magnetic resonance (NMR) structure was 
determined by Northeast Structural Genomics 
Consortium (34) and released in 2009. A comparison of 
the distances between the carboxyl groups on the side 
chains of SSP0047 and the phosphate groups on B-form 
DNA suggests that this protein is a DNA mimic 
(Supplementary Figure S3). Taken together, these data 
suggest that SSP0047 may act as a UGI-like uracil-DNA 
glycosylase inhibitor. Since the SSP0047 protein is 
conserved in different Staphylococcus spp, in the following 
experiments we decided to focus on the SSP0047 homolog 
from the important human pathogen S. aureus. The 
protein is subsequently referred to as SAUGI. 

The interaction between SAUGI and SAUDG 

A His pulldown assay was used to identify possible inter- 
acting partners of SAUGI. In this assay, SAUGI was used 
as bait and proteins extracted from S. aureus were used as 
prey. The result showed many S. aureus proteins being 
pulled down by SAUGI (Supplementary Figure S4). Six 
protein bands were selected and identified using mass 
spectrometry. Interestingly, within these bands there 
were at least 8 DNA binding proteins, including DNA 
topoisomerase IV A subunit (GyrA/ParC), catabolite 
control protein A (CcpA), phosphosugar binding tran- 
scriptional regulator (RpiR family), accessory gene regu- 
lator A (agrA), two-component system response regulator 
(WalR/VicR), GTP-sensing transcriptional pleiotropic 
repressor (CodY), DNA damage-inducible repressor 
(LexA) and SAUDG (Supplementary Table S3). Since 
the protein folding of SAUGI is similar to that of UGI 
(Supplementary Figure S2), we hypothesized that 
SAUDG is likely a good target for SAUGI. We therefore 
further investigated the interaction between these two 
proteins. 

First, the interaction between SAUGI and SAUDG was 
reconfirmed by another His pulldown assay, which 
showed that SAUGI without any tag could be pulled 
down by C-terminal His 6 -tagged SAUDG (Figure 1). 



Table 2. The results of gel filtration and analytical ultracentrifugation (AUC) 



Gel filtration Analytical ultracentrifugation 



Theoretical Calculated Calculated S c predicted by Observed S 

MW a (kDa) MW (kDa) MW b (kDa) HYDROPRO from AUC 



SAUGI monomer 14.4 21.2 14.6 1.68 1.79 

SAUDG monomer 26.0 25.3 26.5 2.64 2.68 

SAUGI/SAUDG complex 40.4 36.7 36.6 3.40 3.31 



"The theoretical MW is based on the amino acid sequence. 

b The program SEDFIT (http://www.analyticalultracentrifugation.com) was used to calculate the MWs by using sedimentation 
coefficients from each protein peak. 

"HYDROPRO (35) predicted the S values based on the atomic coordinates reported in this study. 
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SAUGI/SAUDG complex SAUDG only 




Figure 2. Crystal structures of the SAUGI/SAUDG complex and 
SAUDG. (A) Ribbon diagrams of the SAUGI/SAUDG complex and 
SAUDG. The oc-helices and (3-strands are, respectively, colored red 
and yellow in SAUDG and cyan and magenta in SAUGI. The second- 
ary structural elements and the amino acid sequences of SAUGI and 
SAUDG can be found in Supplementary Figure S7. (B) The SAUGI/ 
SAUDG interface. In the magnified inset, selected residues of SAUGI 
and SAUDG are labeled in black and red, respectively. The pink 



Gel filtration chromatography gave calculated MWs for 
SAUDG and the SAUGI/SAUDG complex that were 
close to the respective theoretical MWs of the SAUDG 
monomer and the monomer-to-monomer complex 
(Supplementary Figure S5A and Table 2). The calculated 
MW of SAUGI was ~50% greater than its predicted 
monomer weight, but this was probably due to nonspecific 
interactions between SAUGI and the column material 
(Supplementary Figure S5A and Table 2). Analytical 
ultracentrifugation results for SAUGI and SAUDG 
were both close to the theoretical monomer values, while 
the calculated M W for the complex was within 10% of its 
theoretical MW (Supplementary Figure S5B and Table 2). 
Taken together, these results suggest that SAUGI forms a 
monomer-to-monomer complex with SAUDG that cor- 
responds to the previously reported UGI/UDG complex. 

The crystal structures of SAUDG and SAUGI/SAUDG 

SAUDG has a conserved fold that is similar to the 
reported UDG structures from other species. Structural 
alignment shows that the Leul84 of SAUDG corresponds 
to the protruding residue Leul91 of E. coli UDG, Phel91 
of Bacillus subtilis UDG and Leu272 of human UDG 
(Supplementary Figure S6A). The side chains of these 
residues insert into DNA's minor groove to replace the 
flipped-out uracil, and they are crucial for the formation 
of a stable complex between UDG and its respective sub- 
strate (36,37). Additionally, three loops [4-Pro, Gly-Ser 
and Leu272 loops; (37)] that accomplish compression of 
the phosphates in the human UDG/DNA complex are 
also conserved in these UDG structures (Supplementary 
Figure S6A). Structural alignment of SAUDG with the 
human UDG/DNA complex further suggested the 
probable location of DNA binding region in SAUDG 
(Supplementary Figure S6B). 

The complex crystal structure of SAUGI/SAUDG is 
shown in Figure 2A. SAUGI binds directly to the DNA 
binding region of SAUDG. In the SAUGI/SAUDG inter- 
face, at least 20 water molecules were observed. These 
presumably form a network of hydrogen bonds between 
the proteins, and so far 12 direct hydrogen bonds and 8 
water-mediated bonds have been found (Figure 2B and 
Supplementary Table S4). In addition, the side chain of 
the protruding residue Leul84 in SAUDG is surrounded 
by the side chains of Ile35, Phe69, Leu71, Ile83 and Met89 
in SAUGI, and this provides hydrophobic interactions 
that further stabilize the complex (Figure 2C). Adjacent 
to this hydrophobic region, SAUDG also has Pro82 and 
Prol83 posed against the side chains of Thr55 and Tyr67 
in SAUGI, which contributes further to the nonpolar 
interactions (Figure 2D). Near the center of the interface, 
there are potential van der Waals interactions between the 



Figure 2. Continued 

dashed lines represent hydrogen bonds. Four of the residues on the 
SAUGI strand P-1, Glu24, Cys25, Ser27 and Glu29, are hydrogen- 
bonded to SAUDG, which suggests the importance of strand P-1 for 
SAUGI's binding affinity. (C-E) Other important interactions that 
contribute to the binding between SAUGI (cyan) and SAUDG 
(green). The SAUGI side chains are colored red, while those of 
SAUDG are orange. 
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side chain of His 180 in SAUDG and the facing side chain 
of Ile28 in SAUGI (Figure 2E). Although it lies outside 
the main interface region, the side chain of Arg85 in 
SAUDG is also stacked against that of His62 in SAUGI 
(Figure 2E). All of these interactions contribute to the 
tight binding between SAUGI and SAUDG. 





D 




Figure 3. A comparison of SAUGI-bound SAUDG and unbound 
SAUDG. (A) The SAUGI-bound SAUDG and the unbound 
SAUDG are, respectively, colored green and red. SAUGI is colored 
cyan. The merged image in the center shows the superimposed struc- 
tures of the bound and unbound SAUDG. Conformational differences 
between the bound and unbound SAUDG are seen in the loop region 
of residues 75-80 (B) and in the side chain orientations of Leul84 
(C) and Arg85 (D). Panels (B-D) are in stereo. The 2Fo-Fc maps of 
these regions in SAUGI-bound SAUDG and unbound SAUDG are 
shown in Supplementary Figure S8. 



Conformational changes in SAUDG and SAUGI on 
complex formation 

The binding between SAUGI and SAUDG did not induce 
any large conformational changes in SAUDG (Figure 3A; 
the RMSD fit is 0.345 A). The most significant conform- 
ational change was found in the loop region formed by 
residues 75-80 (Figure 3B), which corresponds to the 
phosphate-compressing 4-Pro loop in the human UDG/ 
DNA complex. The Lys79 side chain on this loop moves 
forward and forms a hydrogen bond with the Asp59 side 
chain of SAUGI (Supplementary Table S4). The side 
chain orientations of Leu 184 and Arg85 on SAUDG 
were also affected by their interactions with the SAUGI 
residues (Figure 3C and D). 

For SAUGI, an RMSD fit of 1.22 A was calculated 
between the unbound SSP0047 (as determined by NMR) 
and SAUDG-bound SAUGI (as determined by radio- 
graphic crystallography). Structural differences were seen 
in two loop regions (Supplementary Figure S9; residues 
41^49; 73-75). However, it is not clear whether these 
differences were due to the binding to SAUDG, or to 
the different determination methods (i.e. NMR versus 
radiography). 

The binding affinity of SAUGI and SAUDG 

The binding affinities between various uracil-DNA 
glycosylase inhibitors and UDGs were measured using 
surface plasmon resonance (BIAcore) (Supplementary 
Figure S10 and Table 3). The results showed that 
SAUGI and SAUDG have rapid association and slow 
disassociation rates. This turns out a high equilibrium dis- 
sociation constant (KD = 1 nM) and implies SAUGI is a 
strong inhibitor of SAUDG. Interestingly, SAUGI also 
binds to human UDG with a high equilibrium dissociation 
constant (KD = 2.5 nM), a relatively slow association rate 
and the same slow disassociation rate. Moreover, for bac- 
terial SAUDG, the binding affinity of UGI is about twice 
that of SAUGI, while for human UDG, the binding 
affinity of UGI is 12-13 times greater than SAUGFs. 

SAUGI inhibits the uracil-removing activity of SAUDG 

To further confirm that SAUGI acts as a uracil-DNA 
glycosylase inhibitor, the SAUDG activity was determined 
by an in vitro glycosylase cleavage assay in the presence or 
absence of SAUGI. In the absence of SAUGI, SAUDG 
removed the uracil from both ssDNA and dsDNA, as 



Table 3. The binding affinities of uracil-DNA glycosylase inhibitors 
and UDGs 



UGIs to UDGs 



ka (1/Ms) 



kd (1/s) 



KD (nM) 



SAUGI to 2472386 ± 16363 0.00296 ± 0.00010 1.198 ±0.029 
SAUDG 

Phage UGI to 8 904442 ± 419216 0.00610 ± 0.00068 0.685 ± 0.080 
SAUDG 

SAUGI to 495685 ± 22047 0.00124 ± 0.00001 2.509 ± 0.088 
human UDG 

Phage UGI to 6 315 673 ± 187 973 0.001 19 ± 0.00010 0.189 ± 0.021 
human UDG 
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Figure 4. Inhibition of SAUDG's uracil-removing activity by SAUGI. (A) ssDNA. (B) dsDNA. Addition of the DNA mimic SAUGI reduced the 
specific uracil-removing activity of SAUDG in a dosage-dependent manner. 



shown by the cleavage at the resulting apyrimidinic site 
when treated with heat and alkali (ssDNA) or with AP 
endonuclease (dsDNA) (Figure 4A lane 2 and Figure 4B 
lane 3). In contrast, the amounts of cleaved product were 
progressively reduced by the addition of increasing 
quantities of SAUGI (Figure 4A and B). SAUDG 
activity was completely inhibited when the molar ratio 
of SAUGI to SAUDG reached 1:1 for the ssDNA assay 
and 2:1 for the dsDNA assay. The slight difference in the 



inhibiting effect of SAUGI on the ssDNA and dsDNA 
uracil-removing activity by SAUDG may be due to the 
different reaction buffers used. 

DISCUSSION 

The two previously known uracil-DNA glycosylase inhibi- 
tors are well-studied DNA mimic proteins that specifically 
inhibit the activity of UDGs. One of these mimics, phage 
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PBS2 UGI, contains a DNA-like surface that binds to the 
DNA binding region of UDG and thus blocks its uracil- 
removing activity (5-8,38). This serves to protect the 
special genomic DNA of PBS2, which is unusual in that 
it incorporates uracil residues instead of thymine residues 
(39). The other uracil-DNA glycosylase inhibitor, 
p56, was identified from Bacillus phage <\> 29 (9-12). 



Interestingly, despite its similar function, the dimeric 
structure of p56 is different from that of the monomeric 
UGI (10). This p56 has later been shown to form a 
complex with both Bacillus UDG and herpes simplex 
virus (HSV) UDG (11,12). Unlike phage PBS2, the 
genomic DNA of <\> 29 does not contain uracil residues, 
but Serrano-Heras (2006) hypothesized that p56 might 
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Figure 5. A structural comparison of SAUGI, UGI, p56 and their respective complexes with UDG. (A) The known uracil-DNA glycosylase 
inhibitors. The os-helices and P-strands are, respectively, colored red and yellow in SAUGI, UGI and one molecule of the p56 dimer, and cyan 
and magenta in the other p56 molecule. (B) A comparison of the UDG complexes with SAUGI, UGI, p56 and DNA. The uracil-DNA glycosylase 
inhibitors and the UDGs are, respectively, colored cyan and green. The protruding residues from the UDGs are shown as yellow sticks. (C) Details 
of the hydrophobic binding pockets in the interacting regions of the three uracil-DNA glycosylase inhibitors. Residues of the protein inhibitors and 
UDGs are shown as red and orange sticks, respectively. 
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knock out the host-encoded UDG base-excision repair 
system because this UDG activity could still be harmful 
to phage replication if uracil residues occur in the replica- 
tive intermediates (40). This hypothesis was recently 
confirmed (41). 

In terms of the protein folding and charge distribution, 
SAUGI is a DNA mimic that resembles UGI rather than 
p56 (Figure 5A). Figure 5B and C further shows how 
SAUGI, UGI and p56 all share a conserved UDG 
binding strategy, which involves targeting their respective 
UDG's protruding residue (i.e. SAUDG Leul84, E. coli 
UDG Leul91 and B. subtilis UDG Phel91; see 
Supplementary Figure S6A) by means of a hydrophobic 
pocket. Interestingly, unlike the conformational changes 




Figure 6. The side-chain carboxyl groups of three uracil-DNA 
glycosylase inhibitors that match the human UDG-bound DNA phos- 
phate backbone. The carboxyl groups on (A) SAUGI, (B) UGI and (C) 
p56 are shown as red spheres, while the protruding UDG residues are 
shown as yellow sticks. The UDG-bound DNA phosphate backbone is 
colored orange. A good correspondence is seen between the several 
carboxyl groups on SAUGI and UGI (i.e. Glu 26/Glu 20, Glu29/Asp 
40, Glu30/Glu 28, Glu 85/Asp 52 and Glu 63/Glu49). All three figures 
are in stereo. 



seen in the loops and P-zippers of human UDG when it 
binds to DNA (37), only minor conformational changes 
are seen when these uracil-DNA glycosylase inhibitors 
bind to their targets (Supplementary Figure Sll). All of 
these uracil-DNA glycosylase inhibitors therefore bind to 
their targets in conformations that are close to the 
unbound conformations. As shown in Figure 6, while 
many of the carboxyl groups on the side chains of 
SAUGI and UGI share similar locations, most of the 
carboxyl groups on the side chains of p56 occupy different 
locations. The matching carboxyl groups on SAUGI and 
UGI surround the hydrophobic pockets that are used to 
bind to SAUDG Leul84 and E. coli UDG Leul91, re- 
spectively, and they may be functionally important for 
these two uracil-DNA glycosylase inhibitors. 

At this time, the biological meaning of SAUGI remains 
unclear. In particular, why would Staphylococcus produce 
a protein that inhibits its own DNA repair system? One 
possible explanation is as follows. It has recently been 
shown that the replication of DNA viruses can be 
blocked by increasing the amount of dUTP in the viral 
genome (42). Many viruses, such as poxviruses, HSV and 
retroviruses, encode dUTPase and/or UDG to prevent 
genomic uracilation (43), while HIV-1 virus also avoids 
uracilation of its genome by recruiting human UDG 
(44,45). Meanwhile, among the 27 bacteriophages that 
infect 5. aureus, genomic information shows that 13 of 
them encode dUTPase (46). This suggests that these 
Staphylococcus phages are sensitive to genomic 
uracilation. By inhibiting UDG activity and thus prevent- 
ing the removal of spontaneously deaminated cytosine 
from phage DNA, SAUGI may act as an anti-phage 
factor in Staphylococcus. However, this hypothesis 
still needs to be verified, and other roles for SAUGI 
are also possible. For example, SAUGI may simply 
act as a regulator that modulates the SAUDG activity 
in Staphylococcus. As a first step in exploring this possi- 
bility, it would be interesting to observe the effects of 
up- or downregulation of SAUGI expression in 
Staphylococcus. 

While the high binding affinity of UGI to human UDG 
might be cytotoxic to human cells, we note that SAUGFs 
relatively low binding affinity to human UDG 
(Supplementary Figure S10 and Table 3) might allow it 
to directly suppress UDG repair activity in S. aureus and 
other human pathogens. In addition, SAUGI-based drugs 
might be used to decrease the replication of viruses such as 
HSV by inhibiting their UDG activity. Lastly, the His- 
pulldown assay showed that SAUGI interacted with 
other DNA binding proteins in addition to SAUDG 
(Supplementary Figure S4 and Supplementary Table S3), 
and we have further confirmed the interaction between 
SAUGI and the SALEX transcription factor (LexA) 
using a bis (sulphosuccinimidyl) suberate (BS3) cross- 
linking assay (Supplementary Figure S12). These results 
suggest that SAUGI may also act to regulate other DNA 
binding proteins. 

In conclusion, this is the first case of a DNA mimic 
protein identified by using a bioinformatic approach, 
and it also represents the first uracil-DNA glycosylase in- 
hibitor from a non-phage species. 
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The atomic coordinates and structure factors have been 
deposited in the Protein Data Bank, www.pdb.org. The 
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